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AMENDMENT TO THE SPECIFICATION : 

On page 16, beginning at line 1 1 through page 17 ending at line 7, delete the current 
paragraph number and substitute the following: 

A preferred example of an algorithm that is suitable for determining percent sequence 
identity and sequence similarity are the BLAST and BLAST 2.0 algorithms, which are described 
in Altschul et al., Nuc. Acids Res. 25:3389-3402 (1977)-and Altschul et al., J. Mol. Biol. 
215:403-410 (1990), respectively. Software for performing BLAST analyses is publicly available 
through the National Center for Biotechnology Information (http://www.ncbi.nlm.nih.gov/). This 
algorithm involves first identifying high scoring sequence pairs (HSPs) by identifying short 
words of length W in the query sequence, which either match or satisfy some positive-valued 
threshold score T when aligned with a word of the same length in a database sequence. T is 
referred to as the neighborhood word score threshold (Altschul et al., Altschul et al., Nuc. Acids 
Res. 25:3389-3402 (1977) and Altschul et al., J. Mol. Biol. 215:403-410 (1990)). These initial 
neighborhood word hits act as seeds for initiating searches to find longer HSPs containing them. 
The word hits are extended in both directions along each sequence for as far as the cumulative 
alignment score can be increased. Cumulative scores are calculated using, for nucleotide 
sequences, the parameters M (reward score for a pair of matching residues; always >0) and N 
(penalty score for mismatching residues; always <0). For amino acid sequences, a scoring matrix 
is used to calculate the cumulative score. Extension of the word hits in each direction are halted 
when: the cumulative alignment score falls off by the quantity X from its maximum achieved 
value; the cumulative score goes to zero or below, due to the accumulation of one or more 
negative-scoring residue alignments; or the end of either sequence is reached. The BLAST 
algorithm parameters W, T, and X determine the sensitivity and speed of the alignment. The 
BLASTN program (for nucleotide sequences) uses as defaults a wordlength (W) of 1 1, an 
expectation (E) or 10, M=5, N=-4 and a comparison of both strands. For amino acid sequences, 
the BLASTP program uses as defaults a wordlength of 3, and expectation (E) of 10, and the 
BLOSUM62 scoring matrix (see Henikoff & Henikoff, Proc. Natl. Acad. Sci. USA 89:10915 
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(1989)) alignments (B) of 50, expectation (E) of 10, M=5, N=-4, and a comparison of both 
strands. 
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